A  SEQUENTIAL  STOCHASTIC  ASSIGNMENT  PROBLEM 
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A  SFXIUEITTIAL  STOCHASTIC  ACSIOWMENT  PROBI.EM 

by 

Cyrus  Dermarii  Gerald  J.  Llebermarii  and  Sheldon  M.  Ross 
0.  Siimir.ary 

Suppose  tnere  are  n  men  available  to  perform  n  jobs.  The  n 
jobs  occur  in  sequential  order  with  the  value  of  each  Job  being  a 
random  variable  X.  Associated  with  each  man  is  a  probability  p. 

If  a  "p"  man  is  assigned  to  an  "X  «  x"  Job,  the  (expected)  reward 
is  assumed  to  be  given  by  px.  After  a  man  is  assigned  to  a  Job, 
he  is  unavailable  for  future  assignments.  The  paper  is  concerned 
with  the  optimal  assignment  of  the  n  men  to  the  n  Jobs  so  as  to 
maximize  the  total  expected  reward.  The  optimal  policy  is  characterized, 
and  a  recursive  equation  is  presented  for  obtaining  the  necessary 
constants  of  this  optimal  policy. 

In  particular,  if  *  P2  «  optimal  choice  in 

the  initial  stage  of  an  n  stage  assignment  problem  is  to  use  p^^ 
if  X  falls  into  an  i  non -overlapping  interval  comprising  the  real 
line.  These  intervals  depend  on  n  and  the  CDF  of  X,  but  are  iride* 
pendent  of  the  p's. 

The  optimal  policy  is  also  presented  for  the  generalized  assignment 
problem,  i.e.,  the  assignment  problem  where  the  (expected)  reward  if 
a  "p"  man  is  assigned  to  an  "x"  Job  is  given  by  a  function  r(p,x). 

1.  Introduction 

The  sequential  stochastic  assignment  problem  can  be  described  as 
follows.  Suppose  there  are  n  men  available  to  perform  n  Jobs. 
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T!i’  II  arilv*.  In  Moquciitial  order,  l.e.,  firat  Job  1  appears, 

4  u 

by  Job  2,  '.'c.  Aacociated  with  the  J  (j  »  1,2,  ...  ,  n) 

J  b  1:;  u  ruiuium  vurlabk  Xj  wlilch  takea  on  the  value  Xj.  It  will 
!•■  a.niumod  that,  the  X'a  are  Independent  and  identically  distributed 
r.'ii.lom  variatiloa.  Tlila  Job  is  then  referred  to  as  a  "type  Xj" 
Job.  1 1‘  a  "porrt?ct"  man  la  assigned  to  the  type  x^  Job,  a  reward 
Xj  ia  obtnliivd  (the  type  job  ny  then  be  viewed  as  the  maximum  poten- 
tiai  value  of  a  Job).  However,  none  of  the  n  men  may  be  perfect, 
and  whenever  the  man  is  assigned  to  any  type  Xj  Job,  the 
(expectoa.l  reward  is  given  by  where  0  ^  ^ 

1  --  are  known  constants.  After  a  man  is  assigned  to  a  Job, 

he  is  unavailable  for  future  assignments.  The  problem  is  to  assign 
t.lie  n  men  to  the  n  Jobs  so  as  to  maximize  the  total  expected 
rowar'i.  An  assignment  of  men  is  equivalent  to  a  sequential  assignment 
of  the  p's  to  the  X's.  Let  a  policy  be  any  rule  for  assigning 
in-^  n  to  Jobs.  In  particular,  if  the  random  variable  ij  is  defined 
lo  be  the  man  .'identified  by  number)  assigned  to  the  J^^  arriving 
Job,  then  the  total  exp'''cted  reward  is  given  by 


E  I 


n 

2  Pi  X 

=1  J  ^ 


and  the  desired  policy  is  the  one  which  maximizes  (l).  It  should  be 
noted  that  ...  ,  i^)  is  a  random  permutation  of  the  integers 

1,2,  ...  ,  n. 


-  Actually,  the  constraint,  0  <  p^  <  1,  is  given  for  clarity  of 
application,  and  none  of  the  ensuing  results  are  dependent  upon  it. 
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There  are  other  interpretations  of  the  above  model  which  may  be 

usei'uj  to  the  reader.  Suppose  there  exists  n  cards.  Associated 

with  the  card  is  a  probability  p^.  A  sequence  of  independent 

•Ufnticaily  distributed  random  variables  •••  /  X^  are  observed 

in  a  sequential  manner.  When  the  random  variable  X.  appears,  a  card 

J 

must  b  -  chosen  and  played  on  that  random  variable.  If  the  i  card 
is  played  when  X.  =  x  is  observed,  then  the  expected  reward  is 

U  J 

given  by  An  example  of  this  form  occurs  when  Xj  is  received 

with  probability  p^^  and  zero  is  received  with  probability  1  -  p^. 

The  problem  is  to  choose  the  n  plays  of  the  cards  to  maximize  the 

total  expected  reward,  i.e.,  maximize  (1). 

Finally  a  special  case  of  this  model  is  a  generalization  of  the 

"house  hunting"  problem  ll].  Suppose  that  there  are  k  <  n  identical 

houses  to  be  sold.  Offers  arrive  in  a  sequential  manner.  These 

offers  will  be  assumed  to  be  a  sequence  of  independent  identically 

distributed  random  variables  X^,Xg,  ...  ,  X^.  The  seller  may  accept 

or  reject  the  offers  but  must  dispose  of  all  k  houses  by  no  later 

than  the  offer.  In  the  above  "card  interpretation"  let  k  of 

the  cards  have  associated  p*s  equal  to  1  and  let  (n-k)  of  the 

cards  have  associated  p's  equal  to  0.  If  the  seller  accepts  the 

offer  he  assigns  it  a  card  having  an  associated  p  equal  to  1 

and  receives  x.,  and  that  house  and  card  become  unavailable.  If 
J 

the  seller  rejects  the  J  offer  he  assigns  it  a  card  having  an  associ¬ 
ated  p  equal  to  0  and  hence  receives  nothing.  This  procedure  con¬ 
tinues  until  all  the  houses  (and  cards)  are  disposed  of.  The  problem 
is  to  determine  which  offers  to  accept  in  order  to  maximize  the  total 
expected  profit  (or  reward),  i.e.,  maximize  (l). 
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Gection  2  characterizes  the  optimal  policy,  and  presents  a  recursive 
equation  for  obtaining  the  constants  of  the  optimal  policy.  In  Section 
5,  it  is  assumed  that  the  choice  of  the  values  of  p  is  available  to 
the  decision  maker,  and  results  are  presented  for  an  optimum  allocation. 
Section  U  contains  a  detailed  example  which  illustrates  the  concepts 
presented  in  the  earlier  sections.  Finally,  Section  5  generalizes 
the  assignment  problem  to  include  the  case  where  the  (expected)  reward 
if  a  "p"  man  is  assigned  to  an  "x"  job  is  given  by  a  function 
r(p,x) , 

2.  Optimal  Policy 

The  key  result  needed  to  determine  the  optimal  policy  is  to  show 
that  it  is  of  the  following  form.  If  there  are  n  stages  to  go  (n 
men  to  assign  or  n  cards  to  play)  and  probabilities  ^  Pg  »  *  ”  «  ^n^ 
then  the  optimal  choice  in  the  initial  stage  is  to  use  p^  (implying 
using  the  i  man  or  the  i  card  in  the  appropriate  interpretation) 
if  tne  random  variable  X  falls  into  an  i  non- overlapping  interval 

comprising  the  real  line.  Furthermore.,  these  intervals  depend  on  n 
and  the  cumulative  distribution  function  of  X  but  are  independent 
of  the  p's. 

In  proving  the  main  result,  a  well  known  theorem  due  to  Hardy  [2] 
will  be  used 

Lemma  (Hardy's  Theorem).  If  <  •  •  •  <  x^  and  y^.  5  y2  =  ■  ^n 

are  sequences  of  numbers,  then 

n  n 

(2)  max  E  y^  =  T/  ^^yi  f 

■)  j.i  “ 
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where  P  is  the  set  of  all  permutations  of  the  integers  (1,2,  ...  ,  n). 
This  result  implies  that  the  maximum  sum  is  achieved  when  the  smallest 
of  the  x's  and  y’s  are  paired,  the  next  smallest  of  the  x's  and 
y's  are  paired,  and  continued  until  the  largest  of  the  x's  and  y's 
are  paired. 

The  following  notation  will  now  be  introduced:  Let 

f(Pl,Pg,  ...  ,  p^)  =  Total  expected  reward  under  an  optimal 

policy  when  the  probabilities  are 

Pl^Pg'  *“  *  ^n* 

f(Pl,P2,  ...  ,  p^|x)  =  Total  conditional  expected  reward  given 

=  X  under  an  optimal  policy  when 
the  probabilities  are  P2^>P2>  •  •  •  »  P^^* 

That,  in  fact,  optimal  policies  exist  can  be  shown  by  induction. 

Denote  by  the  cumulative  distribution  function  of  the  random 

variable  X.  It  is  assumed  that  Xj^,X2,  ...  ,  X^^  are  independent 
identically  distributed  random  variables  with  CDF  Gjj(z),  and  that 

p  *  E(X)  =  j  zdGjj(z)  <  ». 

The  optimal  policy  is  embodied  in  the  following  theorem  which 
will  be  proven  by  Induction. 


Theorem  1.  For  each  n  >  1,  there  exist  numbers 


=  a^  <  a,  <  a_  < 
0,n  ■  l,n  as  2,n  «= 


<  a  =  +00 
=  n,n 


such  that  whenever  there  are  n  stages  to  go  and  probabilities 

Pi  *  P2  =  ’ '  *  =  ^n  optimal  choice  in  the  initial  stage 

is  to  use  p^  if  the  random  variable  X^  is  contained  in  the  interva] 
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(a,  a.  J.  Tue  a,  iepend  on  but  are  independent  of  the 

1-1, t.  1,11  i,ii  X 

p'ii,  Fui  tt.vrrnoro  a.  is  the  expected  value,  in  an  (n-l)  stage 

1,11 

j.rubJ,t;i!i,  oi  the  quantity  to  which  the  i^^  smallest  p  is  assigned 
(ajsuiniiit  an  optimal  policy  is  being  followed),  and 


(j)  =  2  Piq,„  fop  “11  PiSPsS  •••  £  V: 

1=1 


Proof.  A  proof  by  induction  is  employed.  Suppose  that  there  exist 

numbers  (a,  m  r  1,2,  ...  ,  n-l  such  that  the  optimal  policy 

j  ,m  j  =  i 

in  an  m  stage  problem  is  to  initially  use  the  i  smallest  p  if 

the  initial  value  is  contained  in  the  interval  (a,  ,a  ],  where 

i-i,m  i  ,m 

a  =  -00  and  a  :=  ®.  Then,  in  the  n  stage  problem  where  p, 
0,m  m,ra  °  k 

is  selected  first 


(^)  f(p^iP2.- •  • =  max[xpj^  +  f(Pj^,P2,...,Pj^_^>Pj^^^,...,Pj^)]  . 

K 

Kowever,  by  the  induction  hypothesis.,  it  follows  that  the  optimal 

policy  for  an  (n-l)  stage  problem  is  independent  of  the  (n-l) 

values  of  p.  Hence  defining  a.  as  the  expected  value  (under 

1  ,n 

th 

the  optimal  policy)  of  the  quantity  to  which  the  i  smallest  p 
is  assigned  in  the  (n-l)  stage  problem,  the  total  expected  reward 
'f  that  problem  is  given  by 


fiPj^.Pp. 


n-l  _ 

^n-1^  ”  ^i^i,n  * 
1=1 


:or  every  Pi  <  ^  ^  Pl^P2^  *'■  '  ^n-l 

t;!e  remaining  (n-l)  p's  of  the  original  n  p's  after  the  first, 


I  t  . 


is  chosen  in  the  n  stage  problem).  Furthermore,  since 


a.  is  independent  of  the  p's  and  other  policies  are  obtained 


permuting  the  p^s, 


any  sum  of  the  form  J)  Pj  „ 

i=l  •’i 


(where 


by 


Jl,  jg,  '  ’  ’  '  ^n  1  permutation  of  the  integers)  can  be  obtained 

for  the  total  expected  rewari.  of  the  (n-l)  stage  problem.  Hence, 
using  Hardy's  theorem  (lemma  l)  it  follows  that 


(6) 


a 


l,n 


•  •  * 


<  a 


n-l,n 


since  by  the  induction  assumption  f(pj^,P2,  ...  ,  must  be  a 

maximum. 

Using  the  results  of  (5)  and  (6),  equation  (4)  can  now  be  expressed 


as 


(7)  f(P3^>P2>-««iPnU)  =  max 

Ic 


S  *  ill 


Again,  using  Hardy's  theorem  (lemma  l). 


k*-l 

n 

*i?i 

*^i  i,n 

+  2^  1  n  > 

i.k*+l  ^ 

a^ 

a  = 

+oo) 

0,n 

n,n 

a,  „  T  <  X  <  a 
k-x-l,n  ■*  k’*,n 


This  result  follows  because  the  p's  and  a's  are  ordered  so 

that  if  X  is  greater  than  or  equal  to  the  (k*-l)  smallest  a, 

then  the  corresponding  p  (i.e.,  Pj^^)  must  be  greater  than  or 

equal  to  the  (k*-l)  smallest  p.  Hence,  the  first  choice  in  an  n 

stage  problem  is  to  choose  p.  if  x  c  (a  ,a  ].  Noting  that 

X  x*x^nx^n 

the  result  is  trivial  for  n  =  1  completes  the  induction.  Equation 
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( ';)  I’oliow.i  Imin- lilat.fly  from  equation  (5)i  and  the  theorem  is  complete. 

riM  ort-m  1  presents  the  form  of  the- optimal  policy,  but  does  not 

InllcaLe  t.  w  to  ibtaLn  the  a,  .  The  constants  may  be  calculated 

l,n  ^ 

from  tin*  r*. suits  of  Corollary  1. 


Cofojlary  1.  Peflne  a^  ^  -«*,  a  =  ■♦«>.  Then 
- *■ —  0,n  *  n,n 


f\,n 


i.l,n 


for  i  =  1,2,  ,  n,  where  -«  •  0  and  »  •  0  are  defined  to  be  0. 


Proof.  The  result  follows  by  recalling  that  a^  is  the  expected 

value,  in  an  n  stage  problem,  of  the  quantity  to  which  the  i^ 
smallest  p  is  assigned.  The  r -suit  then  follows  by  conditioning 
on  the  initial  x,  and  recalling  that  p^^  is  used  if  and  only  if 
this  value  lies  in  the  interval  (a.  ,  ,a  ]. 

The  previous  results  assume  that  the  X's  are  independent, 
identically  distributed  random  variables.  An  alternative  set  of 
conditions  leads  to  the  following  theorem. 


Theorem  2.  Suppose  that  the  successive  values  X^,X2f  •  •  •  ^  form 
a  sub-martingale,  i.e., 

Elxjx,,  X-,,  ...  ,  X,  J  >  X,  ,,  for  all  J  >  2  , 

J  1  0-1  =  0-1  = 

tl  n  th-r  optimal  policy  is  to  use  then  Pg,  ...  ,  and  finally 

p  ,  whenever  p,  <  p-  <  •  •  •  <  p  . 

r)-ooL .  Again,  a  proof  ly  Induction  is  employed.  The  result  is  trivial 
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for  n  s  1.  AsBume  It  Is  true  for  all  m  <  n-1.  For  the  n  stage 
problem  where  Is  selected  equation  (4)  still  holds.  However^ 
by  the  induction  hypothesis,  the  optimal  policy  is  specified  for  the 
(n-l)  stage  problem,  and  the  total  expected  reward  for  this  optimal 
policy  in  the  (n-l)  stage  problem  can  easily  be  expressed  in  terms 
of  the  conditional  expectations  of  the  ensuing  X's  given  =  x. 

Hence,  equation  (4)  reduces  to 


(9) 


f(Pl»  •  •  •  j  P^U)  = 

k 


r  k-i 

n  *1 

+  ^  P.E[XJX,  =  x]  . 

i=k+l  ^  ^  J 


Using  the  properties  of  sub-martingales,  it  follows  that  E[X^lXj^  =  x] 
is  monotone  increasing  in  i,  for  i  >  1.  Again,  using  Hardy's 
theorem  (lemma  l),  it  follows  that 


(10) 


f(Pl. 


Poi 


P„|x)  = 


xp^  + 


n 

2 

U2 


PiE[X^ 


X,  =  x] 


and  the  induction  is  complete. 

It  can  be  remarked  that  if  the  successive  values  X^^.Xg*  •••  ^ 
form  a  super-martingale,  i.e.. 


E[Xjlx^,  Xg,  ...  ,  Xj_^J  <  Xj_^  for  all  j  >  2  , 

then  the  same  reasoning  shows  that  the  optimal  ordering  is 

p  ,p  Pt  whenever  p^  <  p„  <  •  •  •  <  p  . 

^n'^n-1'  ^  *^1  *^1  =  *^2  =  =  ^n 

3.  Allocation  of  p's 

In  the  previous  sections  it  was  assumed  that  the  p's  were  a 
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fixed  set  of  given  numbers.  An  extension  is  to  allow  the  p*s  to  be 

determined  in  some  optimal  fashion.  In  the  context  of  the  stochastic 

sequential  assignment  problem,  a  company  has  the  opportunity  to  hire 

skilled  men,  i.e.,  those  having  large  p's,  but  at  the  expense  of 

large  salaries.  In  particular,  suppose  that  c(p)  denotes  the  cost 

to  retain  a  man  having  an  associated  p.  Let  ^  ^'n 

th 

the  expected  value  of  the  quantity  to  which  the  i  smallest  p  is 

assigned  (these  a's  are  the  a^  n+l*'^  Section  2  except  that 

the  second  subscript  is  surpressedj  since  only  an  n  stage  problem 
is  being  considered  no  confusion  should  result).  Then  the  appropriate 
total  expected  reward  for  a  given  allocation  •**  >  where 

<  P2  1  "  ■  S  Pn>  given  by 

(11)  tp(p^,p2,...,p^)  .  .^pj  .  c(pj)  .  [ajPj.c(pj)]  . 


The  problem  can  now  be  stated  as  follows; 

maximize  9(p^,  Pg^  • • • 


subject  t.o 


(12)  ®  =  Pi  =  i  =  1,  2,  . . .  ,  n 

and 

ru)  Pi  ^  P2  =  =  Pn  ^ 

and  solutions  are  presented  for  the  following  five  cases. 


Ca 


If  cfp)  -  c  •  p  with  c  >  0,  then 


10 


•  •  • 


1«1 


It  is  evident  that  cp(pj^,P2,  . . .  ,  p^)  is  maximized  subject 
only  to  (12)  if  Pj^  =  1  when  a^  -  c  >  0,  and  =  0  when  a^-c  <  0, 
However,  it  has  already  been  shown  in  expression  (6)  that 

<  Bg  <  ■  •  •  <  Hence ^  choosing  i*  so  that  it  is  the  smallest 
integer  such  that  "  c  >  0  (if  all  the  a^  -  c  <  0,  then  i* 
may  be  interpreted  as  equal  to  n+l)  it  follows  that  the  optimal 
values  of  p^  subject  to  (12)  are 


and 


p^  =  0,  for  i  <  i* 
p^  =  1,  for  i  >  i*  . 


However,  this  solution  also  satisfies  (ij)  so  that  it  is  a  solution 
to  the  problem.  Note  also  that  the  a^'s,  and  hence  i*,  are 
calculable  by  corollary  1. 


Case  2; 


If  c(p)  =  cp  +  bp  where  c  >  0  and  b  >  0,  then 


n 


cp(p^,  p^,  ...  ,  p^)  -  ^  [(a^-c)p^  -  bp^] 


2  . 


Each  (a^-c)pj^  -  bp^^  is  maximized  at 


Pi  =  -b- 


,  for  (a^-c)  >  0 


and 


Pj  =  0  ,  for  (a^-c)  <  0  . 
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Therefore,  the  optimal  values  of  subject  to  (12)  are 


and 


p^  -  0  ,  for  i  <  1* 


for  i  >  i*  , 


where  is  the  smallest  integer  such  that  a^  -  c  >  0.  If  all  the 
-  c  <  0,  then  i*  may  be  interpreted  as  equal  to  n+1.  Nbte 
that  this  solution  also  satisfies  (13)  so  that  it  is  a  solution  to 
the  problem. 


Case  3: 

If  c(p)  sati  sfies  c(o)  =  0  and  c(p)  is  non-decreasing  and 
convex,  then 


P2^  •  •  • 


n 


i=l 


[aiPi 


c(Pj^)]  . 


Using  the  same  argument  given  for  Case  2  and  noting  that  a^p^-c(p^) 
i  ■  concave  in  p,  the  optimal  solution  takes  on  the  form 


and 


p.  =  0  ,  for  i  <  i* 

■^1 

=  min  (p^,l),  for  i  >  i*  , 


where  i*-  is  the  smallest  integer  such  that  a  -  c'(o)  >  0  and 
p»  satisfies  a^  -  c’Cp*)  =  0.  If  all  the  a^  -  c'(0)  <  0,  then 
i*  may  be  interpreted  as  equal  to  n+1. 


Case 

If  c(pj  satisfies  c(o)  -  0  and  c(p)  is  non -decreasing  and 
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concave,  then 


n 

<P(Pl»  P2»  •••  *  * 

Following  the  same  argument  as  in  the  two  preceding  cases  and  noting 
that  a^Pj^  -  c(p^)  is  convex  in  p,  the  optimal  solution  assumes 

the  form 

=  0,  for  i  <  i* 

and 

p^  =  1,  for  i  >  i*  , 

where  i*  is  the  smallest  Integer  such  that  a^  -  c(l)  >  0.  If  all 
a  -  c(l)  <  0,  then  i*  may  be  interpreted  as  equal  to  n+1. 


Case  3: 

This  case  will  be  concerned  with  the  allocation  of  the  p's 
when  the  p’s  can  take  on  only  a  finite  set  of  possible  values 

7r»  {TT^,  iTg,  ...  ,  with 

The  allocation  problem  can  now  be  written  as  follows 


subject  to 


and 


(14)  Pi  e  n-  . 

This  is  the  original  allocation  problem  with  expression  (12)  being 
replaced  with  (l4).  The  four  cases  considered  for  the  original  problem 
will  be  considered  for  this  new  model. 
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Case  1'  -  linear  cost  function; 


The  arguments  are  identical  to  those  presented  for  Case  1  up 
to  and  including  the  expression  for  determining  i*.  However,  for 
1  <  i^,  the  p^  should  be  chosen  as  small  as  possible,  whereas  for 
i  >  i»,  the  should  be  chosen  as  large  as  possible.  Therefore, 
the  optimal  values  of  p^  subject  to  (ll^)  are 

p^  =  TTj^,  for  i  <  i* 
and 

for  i  >  i*  . 

Again,  this  solution  satisfies  (15)  so  that  it  is  a  solution  to  the 
problem. 


Case  2*  -  quadratic  cost  function: 

The  arguments  are  similar  to  those  presented  for  Case  2,  with 
i*  determined  as  in  Case  2.  The  optimal  values  of  p^  subject  to 
(14)  are 

p.  =  ,  for  i  <  i* 

1 


Again  this  solution  satisfies  (13)  so  that  it  is  a  solution  to  the 
problem. 

Case  y  -  convex  cost  function; 

The  arguments  are  again  similar  to  those  presented  for  Case  3, 
with  i*  determined  as  in  Case  3«  The  optimal  values  of  p^  subject 
to  (14)  are 


p.  -  TT., ,  for  i  <  i* 
1 


TT^,  if  p^  <  in  case  3; 

TT^,  if  p^  =  TT^,  r  =  l,2,...,k,  in  Case  3> 

either  or  ^i’  Case  3>  for  i  >  i* 

satisfies  ^  r  <  k; 

TTj^,  if  p^  >  TTj^  in  Case  3; 


Again,  this  solution  satisfies  (13)  so  that  it  is  a  solution  to  the 
problem. 


Case  4'  -  concave  cost  function: 

The  arguments  are  again  similar  to  those  presented  for  Case  4 
with  i-»  determined  as  in  Case  4.  The  optimal  values  of  p^  subject 
to  (l4j  are 


and 


p,  =  TT^ ,  if  p.  =  0  in  Case  4 

1  1 

p.  =  tt,  ,  if  p.  =  1  in  Case  4  . 

k'  *^i 


Again,  this  solution  satisfi' s  (13)  so  that  it  is  a  solution  to  the 
problem. 
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.  Examp  lf 

In  the  context  of  the  stochastic  sequential  assignment  problem, 
suppose  there  are  four  men  available  to  perform  four  Jobs  occurring 
in  sequential  order  Each  man  has  an  associated  and  is  labeled 
so  that  5  P2  £  P3  5.  Pi4.‘  '*^yP®  asstimed  to  be  a 

uniformly  distributed  random  variable  over  the  range  (0,1000),  i.e., 


0  ,  for  z  <  0 


Gx(z)  =/  z/lOOO,  for  0  <  2  <  1000 


z  >  1000  . 


Using  this  information,  and  equation  (8),  the  required  a^^^  are 


obtained  as  follows; 


(11)  <a 


[y/lOOOldy  =  500 


= 


a  =  -oo 

0.5 


(,  i  i  i  ) 


ri,2 

^  J  [y/l000]dy  +  a^^2[l  -  G^^Ca^^g)  ] 
.  ^  /  y/1000  dy  +  ®i^2^X^®l,2^  ~ 


a  ,  = 

I  ’’’ 


IV )  \  a 


‘0.4  = 

•  CX.' 

1.A 

L 

r®2,3 

2,k  " 

i 

l;-3 

f 

-'®2,3 

[y/lOOO)dy  +  =  50J+.6875 


ly/lOOO]dy  +  ^ 


[y/1000]dy  +  ®2,3S^®2^3^  ^  695.3125 


Suppose  that  the  first  job  to  come  in  is  a  ^800  job.  The  optimal 

policy  calls  for  assigning  the  "best"  man  to  this  job,  i.e., 

since  it  lies  in  the  interval  (695* 3125»“) .  Suppose  that  the  next 

job  to  arriv'e  is  a  $450  job.  There  are  now  5  men  available  and  the 

optimal  policy  calls  for  assigning  man  2  to  this  job,  i.e.,  Pg, 

since  it  lies  in  the  interval  (575,625]  Suppose  that  the  next  Job 

to  arrive  is  a  $400  job-  There  are  now  2  men  available  and  the  optimal 

policy  calls  for  assigning  man  1  to  this  job,  i.e.,  p^,  since  it 

lies  in  the  interval  (0,500]  The  remaining  man,  man  5  (associated 

with  p^)  is  then  available  for  the  last  assignment. 

It  should  be  noted  that  the  assignment  did  not  depend  upon  the 

values  of  the  p's  but  only  on  tne  ordering.  Suppose  that  the  choice 

of  p's  are  available  to  the  decision  maker,  and  the  cost  to  retain 

.  .  2 

a  man  having  an  associated  p  is  given  by  c(,p,'  =  50p  +  300p  . 

The  results  for  Case  2  of  Section  3  are  then  applicable.  It  is 

necessary  to  determine  the  a,.  Recall  that  a.  =  a.  ^  so  that 
^  i  1  i,n+l 

the  a^  ^  are  required.  These  are  as  follows: 
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ne  characteriMtlon  of  the  form  of  the  optimal  asalgnment  policy  la 


embodied  in  Theorem  J. 


Theorem  j.  Assume 


that  r(p,x)  is  differentiable  and 


„  ,  „  Ti  n  there  exist  numbers 

for  each  P^>P2>  •  •  •  ^ 


-00  =  8 


<  a,  ^  „  <  •  •  •  <  ®n-l,n  ^  ®n,n  "  ' 


O^n  IjH  =  2,n 


such  that  whenever  there  are  n  atages  to  go  and  prohabllltles 
„  <  n  <  ■  ■  •  <  p  than  the  optimal  choice  In  the  Initial  stage  Is 

to  use  p^  If  the  random  variable  X  is  contained  In  the  Interval 
(Vl,n’‘l,n‘- 

n  and  X  >  X  »  expression  (l8)  indicates 
Proof.  For  any  Pg  >  P^  ^2  ^ 


•'tJ- 


-Xi  -Pi 


or  equivalently 


■(p  ,x  )  -  rtp^.Xg)  >  rCPj.Xi)  -  rtP^.x^)  . 


Now,  let 


X*  =  suplx:  f(Pi>  P2^  “  rCp^yx)  +  f(P2; 


and  let  x*  be 


if  the  above 


set  is  vacuous.  (Note  that 


•  t  Pjj)  ffiay  easily  be  shown  to  be  continuous  by  induction, 
anl  heiiL'  ,  it'  x»  Ls  finite  then  the  supremum  is  actually  a  maximum.) 
.iuppost  X  -  X*  I  it.-n  let  x  e  (x,x*]  be  such  that  f (p^^iPg, . . .  ,p^|x) 
r''p^,x,  ♦  f^p2 'P^*  ■  •  *Pj, )  •  Now  for  any  j  >  1,  expression  (19) 

can  i)  ■  wr  L t,t.-  n  as 

ri'pyx^)  *  r(p^,x^}  >  r(pj,x)  -  r(Pj_,x)  , 

Of  ai  1  .-mat ively , 

r(p^.x;  +  f(p^,...,p^)  -  r(p^,x)  -  f’(Pi»P2>---‘Pj.i>Pj+i»*-->Pn^ 

>  r(p^,x^)  +  f(Pp...-.P^)  -  -  f(Pi>P2>- • -^Pn^  ' 

However,  the  right-hand  side  of  the  inequality  must  be  greater  than 
or  '.'quai  to  zero  since 

r(Pi,t,  *  f(p,,.  ,  p„) » f(Pi.  ,  pJJp  • 

iioMce,  for  X  <  X* 

X 

•n  p^.x,  e  f(p^,. , .  ,p^)  >  r(p^,x)  f’(Pi*P2^- • '^Pj.l^Pj+l^*  * ’’Pn^  * 

THer(_  fore-  it  fcliows  tr.at  the  optimal  policy  uses  p^,  if  and  only 
if,  X  <  xt.  By  defining  as 

XV  -  sapix  >  xri  f:p^  p^,. .  .  ,p^jx)  =  r(p2,x)  +  f  (pj^,Pj, . . .  ,p^) }  , 

i'  I'oii owj  from  tne  same  reasoning  that  the  optimal  policy  uses  Pg, 
ail!  oniy  li,  x  r  Similar  reasoning  completes  the  proof. 

rai  'omm'.  nt.s  are  in  ord'=-r. 
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<4 


i)  Although  Theorem  3  appears  to  be  similar  to  Theorem  1,  it 
differs  in  that  the  a's  are  not,  in  general,  independent  of  the 


p's,  nor  are  the  a's  easily  calculable. 

ii)  For  the  allocation  problem  in  the  general  assignment  model, 
a  result  similar  to  that  given  for  Case  4  may  be  obtained.  In  parti¬ 
cular,  if  c(p;,.  the  cost  to  retain  a  man  having  an  associated  p, 
is  concave,  and  r(p,x}  is  convex,  then  the  optimal  p's  are  either 
zeros  or  ones.  This  follows  by  showing  that  fCPj^^Pg,  •••  >  P^)  is 
convex  (in  the  vector)  so  that  the  objective  function 

n 

f(Pl.  Pj.  •••  >  P^)  -  c(p^) 

i=l 

is  convex.  The  result  that  f(p^,P2#  •••  >  P^)  is  convex  is  embodied 
in  Lemma  2. 

Lemma  2.  If  for  all  x,  r(p,x)  is  convex  in  p,  then  f  (p^jPg*  •  • .  iP^^) 
is  convex  (in  the  p  vector). 

Proof.  A  proof  by  induction  on  the  number  of  terms  in  the  p  vector 
is  employed. 

For  any  fixed  x,  it  will  be  shown  that  f(p^;Pg,  ...  ,  Pj^K) 
is  convex.  From  (17').,  f(p^ix}  is  convex.  Assume  that 
f(p^,P2,  .  .  ,  Pjjjjx)  is  convex  for  all  m  <  n-1.  It  follows  that 

(20)  x(Pj^.x)  ^  f(p^,  pg,  ...  ,  ...  ,  p^; 

is  convex  for  all  k  -  1,2,  ...  ,  n.  Hence,  the  maximum.,  over  k, 
of  ‘=xpression  (20)  is  also  convex  since  the  maximum  of  a  finite  number 
of  convex  functions  is  also  convex.  Using  equation  (15)> 
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f‘(p^^P2>  •••  »  convex.  The  lemma  is  then  proved 

by  employing  equation  (l6).  It  can  be  noted  that  a  similar  proof 
can  be  used  to  show  that  f(p^,P2,  ...  ,  p^)  is  monotone  (in  the 
p  vector)  if  r(p,x)  is  monotone  in  p. 

iii)  Throughout  this  paper  it  has  been  assumed  that  the  number 
of  assignments  is  equal  to  the  number  of  men.  This  restriction  can 
be  relaxed  easily.  Let  ra  denote  the  number  of  assignments  and  let 
n  denote  the  number  of  men.  If  n  >  m,  choose  only  those  m  men 
having  the  highest  p's  associated  with  them  (assuming  that  r(p,x) 
is  non-decreasing).  If  n  <  m,  add  (n-m)  "pseudo  men"  having  p's 
equal  to  zero  associated  with  them  (assuming  r(0>x)  =  O). 
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an  Indkallon  of  the  aiililory  aecurity  claaalficaiion  of  the  in- 
fonaatien  in  the  paragraph,  wproaontad  aa  <T$),  ($).  fCJ,  or  fl/j. 

There  la  no  limitation  on  tho  length  of  the  abetraet.  How- 
avor,  tha  auggoalad  length  ia  from  ISO  to  22S  wordi. 

14.  EEY  VORDS:  toy  worda  are  technically  meaninghil  tonna 
or  abort  pbraaoa  that  characterka  a  report  and  may  be  uaad  aa 
Indea  enirioa  for  cataloging  tho  report-  Key  worda  muat  ba 
aelectod  ao  SmI  no  aecurity  claealficatlon  la  required.  Idanti-  - 
fkra,  auch  aa  agulpmant  atodol  doaignation,  trade  nama,  military 
project  coda  name,  geographic  location,  may  be  uaed  aa  key 
worda  but  will  ba  followed  by  an  indication  of  technical  con- 
teat.  The  aaalgnmant  of  linka,  lulaa,  and  waighia  ia  optlenul. 
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